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Abstract 

A real polynomial P(Xi, . . . , X n ) sign represents / : A n — > {0, 1} if for every (a%, . . . , a n ) G A n , 
the sign of P(a\, . . . , a n ) equals (— l)/( a i>---><»»). Such sign representations are well-studied in computer 
science and have applications to computational complexity and computational learning theory. The 
work in this area aims to determine the minimum degree and sparsity possible for a polynomial that 
sign represents a function /. While the degree of such polynomials is relatively well-understood, far less 
is known about their sparsity. Known bounds apply only to the cases where A = {0, 1} or A = { — 1, +1}. 

In this work, we present a systematic study of tradeoffs between degree and sparsity of sign repre- 
sentations through the lens of the parity function. We attempt to prove bounds that hold for any choice 
of set A. We show that sign representing parity over {0, . . . , m — 1}" with the degree in each variable at 
most m — 1 requires sparsity at least m n . We show that a tradeoff exists between sparsity and degree, 
by exhibiting a sign representation that has higher degree but lower sparsity. We show a lower bound of 
n(m — 2) + 1 on the sparsity of polynomials of any degree representing parity over {0, . . . , m — 1}". We 
prove exact bounds on the sparsity of such polynomials for any two element subset A. The main tool 
used is Descartes' Rule of Signs, a classical result in algebra, relating the sparsity of a polynomial to its 
number of real roots. 

As an application, we use bounds on sparsity to derive circuit lower bounds for depth-two AND-OR- 
NOT circuits with a Threshold Gate at the top. We use this to give a simple proof that such circuits 
need size 1.5" to compute parity, which improves the previous bound of |"^ 2 |Gol97j . We show a tight 
lower bound of 2™ for the inner product function over {0, 1}™ x {0, 1}™. 



'Supported in part by an NSF Career Award 0133597 and a Sloan Foundation Fellowship 
f Also with Telcordia. Supported in part by NSF CCR-0002299 



1 Introduction 



Let A be a subset of Z and let / : A n — > {0, 1} be a function on vl n . 

Definition 1.1 ^4 polynomial P(Xi, . . . ,X n ) G • • • ,X n ] exactly represents f over A n if for every 

(oi,...,a n ) G A n , P(ai,...,a n ) = f(a 1 ,...,a n ). 

Exact representations of functions by polynomials have been studied extensively in computer science, 
where they have numerous applications in circuit lower bounds[Raz87, Smo87j, hardness of approximation 
[HasOl] and computational learning [MOS03J. In these applications, the set A is generally taken to be 



In this paper, we study a less strict notion of representation of a function by a polynomial, which is 
called sign representation. 

Definition 1.2 [MP68] A polynomial P(Xi, . . . , X n ) £ R[-X~i, . . . , X n ] sign represents f over A n if for 
every (a±, . . . , a n ) G A n , 



Such polynomials are also referred to as Polynomial Threshold Functions for / or Perceptrons. Sign 
representations have been studied in computational complexity theory, where they were used by Beigel, 
Reingold and Spielman to show that the complexity class PP is closed under complement [BRS95J. Beigel 
et al. use such representations to show lower bounds on AC°[BRS91]. We refer the reader to the survey 
by Beigel on applications of such polynomials in complexity theory [Bei93| . 

Further motivation for studying sign representations comes from Valiant's PAC model for computational 
learning [KV94] . If a class of functions on n variables can be sign represented by degree d polynomials, then 
that class can be learnt in time in the PAC-learning model (see [KS01| for a precise statement of this 
result). Indeed, the best known algorithms for PAC-learning central concept classes like DNF formulas and 
intersections of halfspaces use this approach [KS01, KOSQ2]. For this application, there are two parameters 
of interest: the degree of the polynomial and the size of its coefficients. The former determines the running 
time of the algorithm, whereas the latter determines the number of samples required [KS04] . 

While much of the work on computational learning focuses on the Boolean case where the set A is 
taken to be {— 1,+1} or {0, 1}, it is quite natural to consider classes of functions such as decision trees 
when the variables take values from larger sets, especially {0, . . . ,m}. This is explicitly stated as an open 
problem in [MOS03J for a class of functions called juntas which are functions that depend only on some 
unknown subset of X\, . . . , X n of size k where k = 0(log n). It is well-known and easy to show that juntas 
are a special class of decision trees [MOS03J. 

Finally, we will consider an even weaker notion of representing a function by a polynomial which is 
called weak representation, introduced by Aspnes, Beigel, Furst and Rudich. 

Definition 1.3 [ABFR94I A polynomial P(Xi, . . . , X n ) G . . . , X n ] weakly sign represents f over A n 

if for every (a 1 , . . . , a n ) G A n , 



{-!,+!} or {0,1}. 



fiflx, ■ ■ ■ , a n 
f(ai, . . . , a n 



=> P(a 1 ,...,a n ) > 0, 

1 => P(ai,...,an) < 0. 



f(ax,...,a n ) =0 P(a 1 ,...,a n ) >0, 
f(ai, . . . ,o„) = 1 P(ai, . . . ,a n ) < 0. 



and further P{X\, . . . , X n ) does not vanish over the set A n . 
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Weak sign representations (or weak representations for short) have been used in computational complexity 
to show circuit lower bounds. Aspnes et al. [ABFR94] and Klivans [KliOlj use weak representations to 
show that the parity function cannot even be approximated by AC circuits. 

Finally, polynomial representations have been studied as a restricted algebraic model of computation. 
This gives rise to some natural complexity measures, namely the minimum degree and sparsity needed to 
represent a function. These measures, and tradeoffs between them have been investigated previously by 
many researchers [MP681 IBei93l IKP951 IOS03al IOS03bl iKgUTl 1X50411X0502] , Polynomial representations 
have also been studied over finite fields and rings of positive characteristic. This study has yielded useful 
insights into computational complexity [Raz87} ISmo871 IBBR94] . computational learning [MOS03] and 
combinatorics [GroOOl lGro02l |Gop06| . 

1.1 Our Results 

Definition 1.4 For AdTL, the 'parity function PAR : A n — > {0, 1} is defined as 



We will restrict our attention to the setting where A consists of non-negative integers, though our meth- 
ods can be applied to arbitrary sets A C Z. We define the sparsity sp(P) of a polynomial P(X\, . . . , X n ) 
to be the number of monomials in its support when the polynomial is written in the standard monomial 
basis. 

In this work, we present a systematic study of tradeoffs between degree and sparsity of sign repre- 
sentations through the lens of the parity function. Our methods also apply to related functions such 
as inner-product mod 2 (see Definition 16.70 . While tradeoffs between degree and sparsity have been in- 
vestigated by several researchers [Bei941 IKP951 IKS04| . previous work focused on the case A = {0, 1} or 
A = {—1, +1}- In contrast, we attempt to prove bounds that hold for any choice of set A. To motivate this, 
consider the problem of representing Boolean functions on the n-dimensional hypercube by polynomials. 
One could identify the hypercube with the set {a, b} n for any a / 6 £ 1. Indeed, this freedom to choose 
the set A is crucially used by the algorithm of Mossel et al.for learning juntas [MOS03]. Thus it is natural 
to study polynomial representations for arbitrary sets A. While it is known that the minimum degree of 
polynomials representing a function does not depend on the choice of a and b, it is unclear how this affects 
other parameters such as coefficient-size and sparsity. 

Obtaining bounds in this general setting is challenging unlike degree, the minimum sparsity of poly- 
nomials representing a function is known to vary greatly with the choice of the set A. However, we show 
that one can completely classify the minimum sparsity required to represent parity for any set A of size 
2. We obtain non-trivial lower bounds on the sparsity for arbitrary sets A of any size. We obtain tight 
sparsity lower bounds if we assume upper bounds on the degree of the polynomial. We show that there are 
tradeoffs between the degree and the sparsity of sign representations. Below we present exact statements 
of our main results. 

We reprove the result of Minsky and Papert [MP68j that any polynomial that sign represents parity 
over {0, 1}™ has degree n and sparsity 2 n . We generalize this to show that representing parity over 
{0, . . . , m — l} n with the degree in each variable at most m — 1 requires sparsity m n . This result shows that 
low degree representations must have high sparsity. We show a tradeoff between degree and sparsity by 
exhibiting sign representations of lower sparsity but higher degree. We show a lower bound of n(m — 2) + 1 
on the sparsity for polynomials of any degree representing parity over {0, . . . ,m — l} n . This allows us to 
prove tight upper and lower bounds for the case \A\ = 2. For large sets A, we are unable to close the gap 
between our upper and lower bounds. 



n 




(mod 2). 
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Our results indicate that studying sparsity gives useful insights into sign representations. For instance, 
consider the polynomials sign-representing / : {0, l} n — ► {0, 1}. If we place the restriction that each 
variable appears with degree at most 1, there is a unique polynomial that exactly represents every 
function /. However even with this restriction, the polynomials that sign represent a function are not 
unique: for any < a < b, the polynomial niLi( a ~~ bXi) sign represents parity over {0, l} n . We show 
that in any sign representation of parity, the sign of the coefficient corresponding to the monomial Ilies X% 
must be (— l)' 5 ', as in the polynomial above. Thus all sign representations have some similar structure. 

As an application of our methods, we show that lower bounds on the sparsity of sign representations 
can be used to prove circuit lower bounds for Thresholds of Ands circuits [Gol97] (see Definition I6.2[) . We 
give a simple proof that any such circuit for parity requires size at least (§) n . The best bound known 
previously was (§)2\ We also show a lower bound of 2 n for computing the inner product function over 
{0, l} n x {0, l} n which is tight. While our methods are elementary, they give better lower bounds than 
those obtained by using the powerful random restriction method [Gol97]. Our sparsity-based approach also 
differs from most previous results which related the degree of sign-representations to the size of small depth 
circuits [ABFR94, BRS91]. Building on our work, Amano and Maruoka recently used LP-based methods 
to prove lower bounds on circuits comprising on Thresholds of symmetric gates that compute the Inner 
Product function [AM05) . 

1.2 Our Techniques 

The degree of sign representations is better understood than the sparsity for a couple of reasons. Degree 
is less dependent than sparsity on the choice of A [MOS03J. A tool which helps in studying the degree 
of polynomials sign representing symmetric functions is symmetrization [MP68] : we can assume that the 
minimum degree polynomial sign representing a symmetric Boolean function is symmetric. Further, if 
\A\ = m, we can assume that the minimum degree polynomial representing a function has degree at most 
m — 1 in each variable. However, such assumptions cannot be made in the context of sparsity. 

Our main technical contribution is to show that non-trivial lower bounds on the sparsity of sign- 
representations can be obtained using some elementary techniques and a classical result from algebra 
called Descartes' Rule of Signs. Unlike over algebraically closed fields, Descrates' rule of signs shows 
that the number of real roots of a univariate real polynomial can be bounded in terms of the number of 
monomials appearing in it (independent of the degree). Define the sparsity of a polynomial P to be the 
number of monomials that occur in it with non-zero coefficients. We will denote it by sp(-P). 
Descartes' Rule of Signs: Let P(X) € M[X] be a univariate polynomial. Then the number of positive 
real roots of P counted with multiplicities is bounded by the number of sign variations in the sequence of 
its non-zero coefficients written in order. In particular, the number of positive roots of P counted with 
multiplicity is bounded by sp(P) — 1. 

Descartes' rule illustrates that for real univariate polynomials, sparsity is an important parameter 
controlling the number of real zeros. It forms the basis of many efficient algorithms for real root counting 
[BPR03] . An important open problem in real algebraic geometry is to find proper analogues of Descartes' 
rule for multivariate polynomials. The topological complexity (as measured by the Euler characteristics 
or the Betti numbers) of the real zeros of a multivariate real polynomial can still be bounded in terms of 
the sparsity of the polynomial independent of the degree [Kho91, lBas99j . However, the known bounds are 
exponential in the sparsity and are believed to be nowhere near tight. A proper generalization of Descartes' 
rule to multivariate polynomials is still elusive and remains a major open problem in real algebraic geometry 
(see [Stu98j and |LT97] for interesting conjectures and counter-examples and [LRW03J for results in special 
cases). A small first step in this direction might be to show tight sparsity bounds for multivariate sign 
representations of parity for arbitrary sets A. 
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1.3 Related Work 



Minsky and Papert prove that representing parity over {0, 1} inputs needs degree n and sparsity 2 n [MP68J. 
Krause and Pudlak [KP95j show that there is a Boolean function / that has exponential sparsity in the 
{—1,1} basis but polynomial sparsity in the {0,1} basis. O'Donnell and Servedio |OS03aj study various 
extremal properties of such representations. The sparsity of random Boolean functions on {— 1, +l} n have 
been studied in |US03al ISak93] . 

2 Preliminaries 

If P{X\, . . . , X n ) £ M[Xi, . . . , X n ], we use P{X\, . . . , X n _i, c) to denote the polynomial in RLXi, . . . , X n ^\] 
obtained by substituting X n = c in P{X\, . . . , X n ). For c G R, the sign of c denoted sgn(c) is +1, —1 or 
depending on whether c is positive, negative or 0. 

The degree of a polynomial P(X\, . . . , X n ) denoted by deg(P) is the maximum of ^ di over all mono- 
mials Y\i Xf 1 that occur in the support of P(-X~i, . . . , X n ). The degree in the variable which is denoted 
degj(P) is the maximum of di over all monomials in the support of P(Xi, . . . ,X n ). A multilinear poly- 
nomial is one where degj(P) < 1 for all i. The sparsity of a polynomial P denoted sp(P) is the number 
of non-zero monomials in its support. We also define the sparsity in the variable Xi which we denote 
spj(P) to be the number of distinct powers of Xi that occur in P(X±, . . . ,X n ). Note that this is different 
from the number of monomials in which Xi appears. Given a function / : A n — > {0, 1} define its com- 
plement / : A n -> {0, 1} by /(aj_, ...,a n ) = 1 - f(a\, ...,a n ). If P(X 1 , . . . , X n ) sign represents /, then 
—P(Xi, . . . , X n ) sign represents /. 

Lemma 2.1 For i G [A;], let Pi(X\, . . . , X n ) be polynomials in M\X\, . . . , X n ] that sign represent f and let 
Ci be positive reals. Then 

Q(Xi, . . . , X n ) = ^2 c-iPi(Xi, . . . , X n ) 
ie[k] 

sign represents f. 

Proof: Let (a±, . . . ,a n ) G A n . Suppose /(ai,...,o n ) = 0. Then since Pj(ai, . . . , a n ) > for all i, 
Q(ai, ...,a n ) = Ya=i CiP(oi, . . . , a n ) > 0. Similarly if f(a x , . . . , a n ) = 1, Q{a u . . . ,a n ) < 0. | 

Similarly, one can show that if the polynomials Pi(X\, . . . , X n ) weakly sign represent /, then Q(X±, . . . , X n ) 
also weakly sign represents /. 

Theorem 2.2 |PS76|, IBPR03] Descartes' Rule of Signs: Let P(X) = Ya=s °i xi he a real univariate 
polynomial. Let s denote the number of sign changes in the sequence cq, c±, . . . , c n . The number of positive 
roots of P{X) counted with multiplicity is bounded by s. 

Let do, ... , dk-i be non-negative integers such that do < ■ ■ ■ < c4_i. Let do, . . . , a.fc-1 be real numbers 
such that ao < ■ ■ ■ < a/c-i- Define the corresponding generalized Vandermonde matrix as 



V = 


t 


d 

"o 

af 


4 1 ■ 

at • 


dk—i 
■ «0 

. at' 1 


\ 


(i) 




V 


a d ° 
a k-l 






) 





Our goal is to determine the signs of the entries in the inverse of such a matrix. For this we will use the 
following lemma: 
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Lemma 2.3 JPSTff If en > for all i, then det(V) > 0. 



Proof: The proof is by induction on k. The case k = 1 is trivial. Assume that the statement holds up to 
k — 1. Now consider the univariate polynomial in M.[X] defined as 



C(X) = 


do 

"o 

4° 


di 

a o ■ 

di 

a l ■ 


■■ < 




X do 


X dl . 


X d k-1 



Let C{X) = c fc _iA" dfc -! + c k _ 2 X dk - 2 H h c X d ° . The sparsity of C{X) is bounded by k, hence by 

Descartes' rule, it has at most k — 1 positive roots. But ao, • • • , afc-2 are roots of C(X). Hence there are 
no other roots. Hence the sign at a^-i (or at any point to the right of cik-2) is the same as the sign at 
+00. This in turn is the sign of the leading coefficient c^-i of C(X), which is 



Cfc-l 



'0 

-,d 
l k~2 



dk-2 
l k-2 



dk-2 
l k-2 



which is positive by the induction hypothesis. 



Let V- 1 = (v~j) denote inverse of V . Using Lemma 12.31 and the formula for inverse of a matrix, it is 
easy to see that for < i,j < k — 1, sgn(t>^ 1 ) = (— 

We will need to consider the case when ao = 0. If do > 0, then clearly the first row is all 0s and the 
determinant vanishes. On the other hand, if do = we get the matrix 



W 



( 1 
1 
1 

V 1 











dk- 



dk-i 
l k-l 



\ 



J 



(2) 



Lemma 2.4 Let W 1 = (w^) denote the inverse of the matrix W. For < i,j < k — 1, 



sgn(W- ) 




ifi = 0,j > 1, 
otherwise. 



Proof: The minors W, 
the entries in W 

that they are positive. Also det(W) > 0, hence by the formula for matrix inverses, sgn(u>. 



ij for j = and i > 1 are since their top row consists entirely of 0s. Hence 
1 for i = and j > 1 are 0. For the other minors, we can apply Lemma 12.31 to show 



Kl 



(-!)•+'. I 



3 Lower Bounds 

We first consider the case when A = {0, 1}. Assume that P(X\, . . . , X n ) sign represents parity over {0, l} n . 
If the variable X takes values in {0, 1}, then X k = X for k > 2. So we can use the relation Xf = X\ for 
k > 2 to reduce the polynomial P(X\, . . . ,X n ) to a multilinear polynomial. These substitutions can only 
decrease sp(P) and deg(P). 
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Lemma 3.1 If P(X\, . . . ,X n ) sign represents parity over {0, l} n , 

P(X±, . . . , X n ) = X n Qi(Xi, . . . , X n _i) + Qo(Xi, • • • , X n ~i) (3) 

where Qq(Xi, . . . , X n —\) and —Q\(X\, . . . , X n -\) sign represent parity on n — 1 variables. 

Proof: Since P(Xi, . . . ,X n ) is multilinear, by grouping together monomials which involve JQ, we can 
write 

P(Xi,...,X n ) = X n Qi(Xi, . . . ,X n _i) + Qo(Xi, . . . ,X n _i) 
By substituting values for X n , we get 

P(X\, . . . , X n _i, 0) = Qo(X\, . . . , X„_i), 

P(Xi, . . . , X n _i, 1) = Qi(-X"i, • • • , X n -i) + Qo(^i) • • • 5 Xfi-l)- 

We now use the so-called self-reducibility of the parity function: 

PAR(ai, . . . ,a n _i,0) = PAR(ai, . . . ,a n -i), 
PAR(ai, . . . ,a n _i, 1) = PAR(ai,. . . ,a n -i). 

From this it follows that P(Xi, . . . , X n _i, 0) = Qo(Xi, . . . , A" ra _i) and — P(Xi, . . . , X n _i, 1) sign represent 
parity on n — 1 variables. Also, we have 

— Qi(Xx, . . . ,X n -i) = P(Xi, . . . ,X n -i, 0) — P(Xi, . . . ,X n -i, 1). 

Hence Qi(X\, . . . , X n -%) sign represents parity by Lemma [2TT1 | 

The polynomial — sign represents parity over {0, l} n . We will show that the degree and 

sparsity cannot be lower for any sign representation. 

Theorem 3.2 [MP 68] If P{X\, . . . ,X n ) sign represents parity over {0, l} n , then it must have degree n 
and sparsity 2 n . 

Proof: Observe that the sparsity bound of 2 n implies that every monomial including Yli=i X% has a 
non-zero coefficient, hence the degree is n. So it is sufficient to prove the sparsity bound. 

The proof is by induction on n. For n = 1, let P{X\) = aX\ + b. P(X\) must satisfy the conditions 

P(0)=6>0, P(l)=a + b<0. 

This implies b > and a < — b < 0, hence sp(P) = 2. 

Assume inductively that the claim holds for n — 1 variables. Write P(X\, . . . ,X n ) as in Lemma 13.11 
Observe that sp(P) = sp(Qo) + sp(Qi), since there cannot be cancellations between the monomials in 
X n Qi(Xi, . . . ,X n -i) and Qq(X±, . . . ,X n -%). By the induction hypothesis sp(Qo) = sp(Qi) = 2 n_1 , hence 
sp(P) = T. | 

We can strengthen the claim to show that the sign of the coefficient of every monomial is fixed. For 
Sd [n], we denote the coefficient corresponding to the monomial n«g5^ by c s- Thus 

p(x u ...,x n ) = j2 csiiXi. 

SC[n] ieS 
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Theorem 3.3 If P(X\, . . . ,X n ) sign represents parity on {0, l} n , then sgn(cs) = (— l)' 5 '. 

Proof: The proof is by induction on n. The case n = 1 follows from the Proof of Theorem 13.21 Assume 
inductively that the claim holds for n — 1 variables. Write P(Xi, . . . , X n ) as in Lemma [3. 11 The monomials 
involving X n come from X n Q\{X\, . . . , X n _i) while those not involving X n come from Qq(X\, . . . , X n —i). 
Now consider S C [n] such that n ^ S. The coefficient cs in P(Xi, . . . , X n ) is the same as the coefficient in 
Qo(Xi, . . . ,X n _x). Since Qa(Xi, . . . ,X n _i) represents parity on n — 1 variables, hence sgn(cs) = (— 
by the induction hypothesis. For S C [n] such that n G S, the coefficient eg in P(Xi, . . . ,X n ) is equal to 
the coefficient c s \^ n y in Qi(X\, . . . ,X n ^±). Since Qi(X±, . . . ,X n -i) represents the complement of parity, 
sgn(c5) = — (— l)! 5 ' -1 = (— l)\ s \ by induction. | 

One can similarly show a bound on the sum of the coefficient sizes for polynomials with integer coeffi- 
cients. We omit the proof. 

Next we generalize Theorem 13.31 to the case when A = {0, . . . ,m — 1} and the degree in each variable 
is at most m — 1. To construct a polynomial sign representing parity satisfying these conditions, for 
< j < m — 2, let ay = j + |. Let 

n m—2 

p{x u ...,x n ) = nnnrn-")) 

t=l j=0 

It can be verified that P(X\, . . . , X n ) indeed sign represents parity on A n and sp(P) = m n . 

Define the univariate polynomial M(X) G R[x] by M(X) = W-^iX-j). Note that M(X) is a monic 
polynomial of degree m which vanishes on the set A = {0, ... ,m — 1}. By Euclidean division, for any 
d > m, we can write 

X d = Q d {X)M(X) + R d (X) 

where deg(Rd) < m — 1. 

The polynomials M(JQ) for i G [n] vanish on the set A n . Given any polynomial P'(X\, . . . ,X n ) which 
sign represents parity over A n , we can reduce P'{X\, . . . ,X n ) modulo the polynomials M(JQ) using 

Xt = R d (Xi) (modM(X i )) 

to obtain a polynomial P(X\, . . . , X n ) such that 

P(X U ...,X n ) = P'{X U ...,X n ) (mod M{X{), . . .,M(X n )) 

The polynomial P(Xi, . . . , X n ) agrees with P'(X\, . . . , X n ) over the set A n , and degj(P) < m — 1. However 
we will show that such polynomials where degi(P) < m — 1 require sparsity m n . 

Lemma 3.4 Assume that P(X\, . . . , X n ) sign represents parity over {0, 1, . . . , m— l} n . Ifdeg n (P) < m—1, 
then 

m—1 

P(X l ,...,X n ) = ^ X^QiiXi, Jf n _i) (4) 

where (—l) l Qi(Xi, . . . ,X n _i) represents parity on n — 1 variables. 
Proof: Since deg n (P) < m — 1, grouping monomials by powers of X n , 

m—1 

P(Xi, . . . , X n ) = ^ X % n Qi{Xi, . . . , X n _i). 

i=0 
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Let 



W 



( 1 ... \ 

11 ... l m - 1 

1 

\ 1 m-l ... (m-l)™- 1 ) 



By substituting values through m — 1 for X n , we get 



/ QoC^lj • • • >-Xn-l) \ 

Qi(Xi, . . . , X n -x) 

\ Qm-l(Xi, . . . , X n -i) J 

( Qo(X\, . . . ,x n ^i) ^ 
Qi[Xi, . . . , X n _i) 

\ Qm-l(^lvi^n-l) / 



/ P(Xi,...,X n „i,0) 

P(Xi, . . . , X n _x, 1) 
V P(X 1 ,...,X n _ 1 ,m-l) J 

( P(X 1 ,...,X n - 1 ,Q) 



\ 



P(Xi, . . . , X n _i, 1) 
\ P(X 1 ,...,X n _ 1 ,m-l) J 



We now expand the LHS. Consider the top row of W , which is indexed by i = 0. By Lemma 12.41 the 
first entry is some number w^q > 0, and the other entries are 0. This implies 

Qo(X\, • • • j X n -i) = w qP(Xi, . . . , X n _i, 0), 

so Qo(X\, . . . , X n ^i) sign represents parity on n — 1 variables. 
For i > 1, = (— l) l+J |u;^ |. Hence 

m— 1 

Qj(Xi, . . . , X n _i) = 

(-lrQ^Xi,...,^!) = 

We now use the self-reducibility of the parity function: 
PAR(ai, . . . ,a n _i, j] 



^2(-l) i+j \wT j 1 \P(X 1 ,...,X n ^j) 



j=0 
m—1 



Y,{-iy\w-^\p(x l ,...,x n ^j). 



(5) 



3=0 



fPAR(ai, . . . , a n _i) if j = (mod 2) 
jpAR(a l5 ...,a n _i) if i = 1 (mod 2) 



Hence the polynomial (— l) 3 \wij\P(Xi, . . . , X n -\,j) sign represents parity onn-1 variables for all j. Hence 
by Lemma l2.lt (— l) l Qi(Xi, . . . , X n ) also represents parity onn-1 variables for every i. | 



Theorem 3.5 Let P(X\, . . . , X n ) be a polynomial that sign represents parity over {0, 1, . . . , m — l} n , with 
deg(Xj) < m — 1 for all i. Then P(Xi, . . . ,X n ) has sparsity m n and the sign of the coefficient of the 
monomial Y[j X % ? is (— 1)^* J . 

Proof: The proof is by induction. The base case n = 1 is an application of Descartes' rule. Let 

m— 1 

P{X 1 ) = £ CiX{ 

i=0 
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Let < k < m — 2. Since P{k) and P(k + 1) have opposite signs, P(X\) has a root a k in the interval 
(k, k + 1). Since the degree of P{X\) is bounded by m — 1, 

m-2 
k=0 

To determine the sign of c m _i, substitute X\ = 0. 

m-2 

p(o) = (-i)"- 1 ^-! n «* 

fc=0 

Since P(Xi) represents parity, -P(O) > 0. Since all the au are positive, we must have sgn(c m _i) = (— l) m . 
Now applying Descartes' rule, since P has m—1 positive roots, there must be m sign changes in the sequence 
Co, ... , c m _i. Hence, sgn(cj) = (— 1)\ This implies that sp(P) = m. 

The inductive case proceeds using Lemma 13.41 exactly as in Theorem 13.21 We skip the proof. | 

Corollary 3.6 If P'(X\, . . . , X n ) sign represents parity over {0, 1, . . . , m — 1}™, then deg(P') > n(m — 1). 

Proof: We quotient out the P'(X 1 , ...,X n )by the polynomials M . . . , M(X n ), to get P(Xi, X n ) 
where degj(P) < m — 1. Note that this only reduces the total degree, hence deg(P') > deg(P). By Theorem 
13.51 sp(P) > m n . This implies that for every tuple (di, . . . , d n ) where d{ < m — 1, the monomial Y\i Xi i 
occurs with non-zero coefficients. Thus the monomial Hig[n] X™"" 1 is in the support, which implies that 
deg(P) > n{m — 1). | 

The same proof extends to sets of the form A = {a, a + 1, . . . , a + m — 1} for a > 0. This implies the 
following corollary, by taking a = 1 and m = 2. 

Corollary 3.7 If P(X\, . . . , X n ) is a multilinear polynomial that sign represents parity over {1, 2} n , it has 
sparsity 2 n . 

A natural question is what happens to the sparsity if we allow polynomials of higher degree. It might 
be that there are polynomials of high degree and low sparsity and quotienting by the M(Xj)s causes the 
sparsity to increase. We will address this question in Section 4. We next turn our attention to weak 
representations. 

3.1 Weak Representations 

We first consider weak representations for parity with low degree. Over {0, l} n , the polynomial P{X±, . . . , X n ) = 
{—l) n Y\iXi gives a weak representation with sparsity 1, and in fact this is optimal with regard to degree 
too. 

Lemma 3.8 [ABFR94J Any polynomial that weakly sign represents parity over {0, l} n has degree n. 

We show that over A = {0, 1, . . . , m— l} n , a lower bound of (m— l) n still applies for weak representations 
when the degree in each variable is at most m — 1. 

Lemma 3.9 If P(X\, . . . , X n ) weakly sign represents parity over A = {0, 1, . . . , m — l} n ; and i/deg n (P) < 
m — 1, then 

m—1 

P(Xx,...,X n ) = J2 x iQi(XiT--,Xn-i) (6) 

i=0 

where for i > 1, the polynomial (—l) l Qi(Xi, . . . ,X n _i) weakly represents parity on n — 1 variables. 
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Proof: The proof is similar to that of Lemma 13.41 the difference being that we need to show that the 



polynomials Qi(X±, . . . ,X n -i) do not vanish over the set A n . 
for X n and inverting the Vandermonde matrix, we get 



/ Qo{X\, ■ ■ ■ , X n -i) \ 
QiyXx, . . . , X n _i) 

\ Qm—1 



w 



( P{X Xi 



By substituting values through m — 1 

• ,X n _i,0) \ 
• , X n _i, 1) 

.,X n -i,m- 1) / 



For i > 1, by Equation ©, we have w-- ^ and 



(-l) l Q i (X 1 ,...,X n ^ 1 ) 



m—1 

J2(-iy\ w r j 1 \p(x 1 ,....,x > 

3=0 



n—1 



Since P{X\, . . . ,X n ) weakly represents parity on A n , the polynomial (— l) 3 P(Xi, . . . ,X n -i,j) either 
weakly represents parity, or it vanishes over A 11 ' 1 . Since P{X\, . . . , X n ) is a weak representation of parity, 
it does not vanish on A n . Hence there is a point (a±, . . . ,a n ) E A n so that P(a\, . . . ,a n ) ^ 0. Hence 
the polynomial P{X\, . . . , X n _i, a n ) does not vanish over A n ~ l . Hence by Lemma 12.11 the polynomials 
(—iyQi(Xi, . . . , X n _i) weakly sign represent parity on A n ~ x for i > 1. | 

The condition i > 1 in the statement of Lemma l3.9l is in fact necessary: take the polynomial P(X\, . . . , X„ 
(— l) n Yii Xi that weakly sign represents parity on {0, l} n . In this case, Qq{X\, . . . , X n _x) = 0, so it does 
not represent parity even weakly. 

We use Lemma this to show a lower bound of (m — l) n on the sparsity of weak representations over 
A n . The base case n = 1 is proved using Lemma [ 



Lemma 3.10 Any univariate polynomial P(X) G M.[X] that weakly sign represents parity over {0, . . . , m— 
1} must have m — 2 roots in the interval (0,m — 1]. 

Proof: We first show that deg(P) > m — 1. Assume that this is not so. Let X±, . . . ,X m _\ be variables 
that take values in {0, 1}. Then the polynomial 

m—1 



Q(X 1 ,...,X m -i) = P(J2x i 



i=i 



weakly sign represents parity on {0, l} m_1 and deg(Q) < m — 1, contradicting Lemma [ 

Now consider the factorization of P(X) over the reals. Assume that this contains an irreducible 
polynomial D(X) £ R[X] with deg(D) = 2. The polynomial D(X) does not have real roots its sign stays 
unchanged in the interval [0, . . . , m — 1]. Hence we can replace D{X) by the constant sgn(Z)(0)), and 
get a sign representation of lower degree. Similarly, consider a linear factor of the form {X — a) where 
a ^ [0, m — 1]. Such linear factors can also be replaced by their signs at 0. Further we may assume that 
there is a root of multiplicity at most 1 at 0. If not, we can write P(X) = X k Q(X) for k > 2. The 
polynomial XQ(X) has the same sign at each point in [0, m — 1] and only smaller degree. We are left with 
a polynomial of the form 

P'{X) = H(X - on) Oi€[0,m-1] 

which weakly represents parity over {0, . . . , m — 1}, hence deg(P') > m — 1. Since is a root of multiplicity 
at most 1, at least m — 2 of the roots 014 lie in the interval (0, m — 1]. | 
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Theorem 3.11 Let P(X\, . . . , X n ) be a polynomial that weakly represents parity over {0, . . . , m — l} n . If 
degj(P) < m — 1 for every i E [n], then sp(P) > m n . 

Proof: The proof is by induction on n. When n = 1, by Lemma 13. 101 P(X-\ ) has m — 2 roots in (0, m — 1], 
hence by Descartes' rule, sp(P) > m — 1. 

For the inductive case, we use Lemma 13.91 From Equation © it follows that 

m— 1 



ap(P) > Y, S P(^)- 



i=l 

By Lemma [3. 9 \ the polynomial (— l) % Qi(X\, . . . , X n —i) weakly represents parity on A™ -1 . Hence by induc- 
tion, sp(Qj) > (m — l) n_1 , and so sp(P) > (m — l) n . | 

This bound is in fact tight. 

Lemma 3.12 There is a polynomial P(Xi, . . . ,X n ) that weakly represents parity over {0, ... ,m — l} n 
where deg i (P) < m — 1 for every i G [n], and sp(P) = (m — l) n . 

Proof: Take Q(X±, . . . , X n ) to be a polynomial that sign represents parity on {1, . . . , m — l} n satisfying 
sp(Q) = (m — l) n , and degj(Q) < m — 2. We claim that the polynomial 

P(Xl, . . . , X n ) = Q(Xi, . . . , X n ) • Y\ Xi 

i=l 

weakly represents parity over {0, . . . , m — l} n . This is because, for a\, . . . , a n G {0, . . . , m — l} n , 
sgn(P(ai, . . . ,a n )) 



if etj = for some i, 

sgn(Q(ai, . . . , a n )) otherwise. 



Further, since degj(Q) < m — 2, degj(P) < m — 1 for all i G [n]. Also, sp(P) = sp(Q) = (m — l) n . | 

The proof of Lemma [3 . 1 2 1 crucially uses the fact that G vl. Indeed we will show that if A = {1, ... , m}, 
then weak representations of parity require sparsity m n . 

Corollary 3.13 Let P{X\, . . . , X n ) be a polynomial that weakly represents parity over {l,...,m} n . If 
degj(P) < m — 1 /or every i E [n], then sp(P) > m n . 

Proof: Let P(X\, . . . ,X n ) be as above. The polynomial 

n 

P'{X 1 ,...,X n ) = P(X 1 ,...,X n )-l[X i 

i=l 

weakly represents parity over {0, . . . ,m} n . Further, since degj(P) < m — 1, degj(P') < m for all i G [n]. 
Hence we can apply Theorem 13.111 which implies 

sp(P) = sp(P') > m n . 
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4 Upper Bounds 

Does the lower bound of m n in Theorem 13.51 hold for all polynomials? Or are there polynomials with higher 
degree but lower sparsity? We show that such a tradeoff is indeed possible. 

Theorem 4.1 There exists a polynomial P(Xi,...,X n ) that sign represents parity over {l,2} n with 
deg(P) = n 2 and sp(P) = n + 1. 

Proof: Define w : {l,2} n — » {2 k }^ =0 by io(oi, . . . , a n ) = Hi a «- ^ w(a\, . . . ,a n ) = 2 k , then = 2 for 
exactly k co-ordinates i G [n] hence PAR(ai, . . . , a n ) = n — k (mod 2). 
Choose points ay G (2 J_1 , 2 J ) for 1 < j < n and let 

n / n \ 

P(X 1 ,...,X n ) = ]J mXi-aj A, 

j=l \i=l J 

so that deg(P) = n 2 and sp(P) = n + 1. We claim that P{X\, . . . , X n ) sign represents parity on {1, 2} n . 
Note that 

n 

P(ai, . . . ,a n ) = JJ(w(ai, . . . ,a n ) - Qj). 

3=1 

If w(ai, . . . , a ra ) = 2 k , then 



P(a 1 ,...,a n ) = n(2"-a J ) 

3=1 

=> sgn(P(a 1 ,...,a n )) = (-l)"- fc . 
Thus the polynomial P(Xi, . . . , X n ) sign represents parity. Its sparsity is n + 1 and its degree is n 2 . | 

In contrast, Corollary 13.71 shows a lower bound of 2 n on the sparsity for sign representations by multi- 
linear polynomials. 

We can extend Theorem 14.11 to show that for any set A of non-negative integers of size m, there are 
polynomials that weakly sign represent parity whose sparsity is less m n , but which have high degree. 

Theorem 4.2 For any set A of non-negative integers of cardinality m, parity can be weakly sign repre- 
sented over A n by a polynomial P(X±, . . . ,X n ) that has sparsity at most ( n+ ™ -1 ) and degree n ■ ( n+ ^~ 1 )- 

Proof: Define the function w : A n — > Z by w(a±, . . . , a n ) = ]T. o«. This maps A n to a set S of size at 
most ( n+ ™~ ) in Z. Let a denote the largest integer in the set A. Note that w(a, . . . , a) = a n is the largest 
integer in S. Further (a, . . . , a) is the unique point in A n that is mapped to a n by w. We claim that the 
polynomial 

p(x l ,...,x n )=(-ir- n (n*-° 

a<=S\{a n } \ie[n] 

weakly represents parity on A n . To prove this, note that 

P(a 1 ,...,a n ) = (-l) na ■ ]"[ (w(a 1 ,...,a n ) - a) 

a£S\{a n } 
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Hence the polynomial vanishes for every point in A n \ {(a, . . . , a)}. Further 

P(a,...,a) = (-ir- J] (a n -a) 

a£S\{a n } 

^sgn(P(a,...,a)) = (-ir. 
Thus P(Xi, . . . , X n ) weakly represents parity on A n . | 

In general sp(P) = \S\ and \S\ depends on the set A. For some sets A, sp(P) can be significantly 
smaller that the bound stated ( n+ ™ _1 ) stated above. In the case when A = {0, ... ,m — 1} and n is a 
fixed constant, one can in fact show that sp(-P) = o(m) n . This is a consequence of Erdos' multiplication 
table theorem which states that the number of distinct integers less than m n which can be expressed as 
the product of n numbers each less than m is o(m) n [BPV98J. 

5 Lower Bounds without Degree Restrictions 

We will now show a lower bound which holds for all polynomials strongly representing parity on {1, ... , m} n 
without any restrictions on the degree or sparsity of each variable. The proof is a generalization of the 
proof idea of Theorem 13.51 

Theorem 5.1 Let P(Xi, . . . ,X n ) be a polynomial which sign represents parity over {1, . . . ,m} n . Then 
sp(-P) > n(m - 1) + 1. 

Proof: The proof is by induction on n. When n = 1, the claim follows by Descartes' rule. Assume it is true 
for n — 1. Recall that sp n (P) is the number of distinct powers of X n that occur in monomials in the support 
of P. If we set all the other variables to 1, the univariate polynomial Q(X n ) = (— l) n_1 P(l, . . . , l,X n ) 
sign represents parity on {1, . . . ,m}, hence it must have sparsity at least m. Hence sp n (P) > sp(Q) > m. 
If k > n(m — 1) + 1, there is nothing to prove. Hence we may assume m < k < n(m — 1). 
Grouping monomials in P(X\, . . . , X n ) by the power of X n they contain, we can write 

k 

P{X 1 ,...,X n ) = J2 X nQi(Xl,---,Xn-l)- 

i=l 

By substituting values 1 through m for X n , we get 



/ 1 


■ 1 


1 


■ 1 




( Qi(x u . 


■ , x n _x) 




( P(Xi,. 


■ ■ 7 X n _l 


1) 




2 dl 






• 2 dk 




Q2(x l7 . 


■ , X n -x) 




P(Xi,. 


■ ■ 7 X n -l 


2) 




\ m dl 


• m dm 




■ m dk 


) 


\ Qk(X\, . 


■ , X n -i) 


) 


V p(x u . 


■ ■ j X n -i 


m) 


) 



(7) 



We denote the mxk matrix by A. While we cannot prove that each Qi(X\, . . . , X n _i) represents parity (or 
its complement), we will show that appropriate linear combinations of the Qi(X\, . . . , X n _i) sign represent 
parity. We pre- multiply each side of Equation © by [/, which is the inverse of the m x m generalized 
Vandermonde matrix consisting of the first m columns of A. 



( l ■ 


• 


&l,m+l 


■ h,k 


\ 


( Qi(X lf . 


■ ■ 7 X n -\) 


\ 




( P(Xi,. 


■ 7 X n -l 


1} \ 


• 


• 


i>2,m+l 


■ b 2 ,k 






■ ■ 7 X n -\) 




= u ■ 


P(X l7 . 


■ 7 X n -l 


2) 


\o • 


• 1 




bm^k 


) 


\ Qk(Xi, . 


■ ■ 7 X n -l) 






{ P(X U . 


■ j X n — i 


m) / 
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Using the sign alternations of the entries of U, we conclude that for 1 < i < m the polynomials 

k 

Ri(Xi, . . . ,X n _i) = Qi(Xi, . . . ,X n -x) + ^2 bijQj(X\, . . . ,X n -i) 

j=m+l 

sign represent parity or its complement on n — 1 variables. Hence by applying the induction hypothesis, 

sp{Ri) > (n-l)(m-l) + l. 

But we also have 

k 

sp(flO < sp(Q 4 ) + s P(^i)- 

j=m+l 

Hence we get 

k 

sp(Qi)+ Yl s P(^i) ^ (n-l)(m-l) + l. 
j=m-yx 

By choosing the matrix C7 to be the inverse of an appropriate sub-matrix, we can obtain a similar equation 
for any subset of the QiS of cardinality k — m + 1. There are ( m _i) such subsets. Each Qi occurs in exactly 
(m-i) °f them. Hence we get 



V 7 8=1 



^ )((» l.)(m l)+ U 
m—1; 



k 

E s p(^) ^ r— ; — r((n-l)(m-l) + l) 



— m + 1 

8=1 



The quantity monotonically decreases as increases. In the range m < k < n(m — 1), it is always 

n(ro— i)+l 

(n-l)(m-l)+l 



greater than .^T/ 1 ^^ which is the value it takes for k = n(m — 1) + 1. Hence 



^sp(Qj) > n(m -1) + 1 



i=l 



But sp(P) = X]f=i s P(Qi); hence the claim is proved. | 



Corollary 5.2 ylriy polynomial that sign represents parity over {1, 2} n musi /iaue sparsity at least n + 1. 

This follows by substituting m = 2 in Theorem 15.11 This shows that the construction of Theorem 14.11 is 
optimal with regard to sparsity. We can now prove tight lower bounds on polynomials sign representing 
parity on A n for any set A of size 2. Let A = {a, b} where < a < b. 

• If a = 0, then any polynomial which sign represents parity has sparsity at least 2 n . 

• If a > 0, then any polynomial which sign represents parity has sparsity at least n + 1. 

While the lower bound of n(m — 1) + 1 in Theorem 15.11 is tight for m = 2, this is far from the upper 
bound of Theorem 14. II for large m. It would be interesting to close this gap. 
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6 Circuit Lower Bounds 



We shall use bounds on the sparsity of parity to derive lower bounds on the size of certain restricted 
circuits. The circuits we consider are rather weak, however the proof of the lower bound is simple and 
yields better parameters than were previously known. 



Definition 6.1 A Threshold function f : {0, l} n — > {0, 1} is defined as 

/(ai, . . . ,O n ) : 



1 ifwo + Ya=1 w i a i < 
ifw + Ya=1 w i a i > 



where Wq, . . . , w n G R. The coefficients Wi are called the weights of the Threshold function. A gate 
computing a threshold function is called a Threshold gate and is denoted by THR. 

In our definition, we assume that wq, . . . , w n are such that wo + Ya=1 w i a i / for ai, . . . , a n € {0, 1}™. 

Definition 6.2 A Threshold of Ands circuit (denoted THR o AND J is a depth-two circuit with Boolean 
inputs X\ , . . . , X n . The top level of the circuit consists of a single THR gate, while the bottom level 
consists of AND gates. The inputs to the AND gates are the inputs X%, . . . ,X n and their complements 
->Xi, . . . , —*X n . The size of a THR o AND circuit is defined to be the number of AND gates in the bottom 
level of the circuit. The minimum size of a THR o AND circuit needed to compute a Boolean function f 
is denoted by S(f). 

These circuits are well-studied (see |Gol97| and the references therein). By De Morgan's law, such 
circuits can simulate OR gates at the bottom level. We will show that S(f) corresponds to minimum 
sparsity required to sign represent / over a certain basis. Thus proving circuit lower bounds is equivalent 
to proving bounds on the sparsity of sign representations. 

To begin with, assume that the inputs to the AND gates were only the variables X{, not their comple- 
ments. Each AND gate computes a function of the form Jligyi Xi where A is the set of inputs into the gate. 
Such a circuit computing parity corresponds to a sign representation of parity in the standard monomial 
basis. The number of AND gates is exactly the number of non-constant monomials required. By Theorem 
IO this is 2™ - 1. 

In a general THR o AND circuit, and AND gate computes the function Aj g /Aj A je j —>Xj. We can 
assume that / H J is empty, else the AND gate computes the function 0. Thus the AND gate computes 
the polynomial 

B{x 1 ,...,x n ) = Hx i H(i-x j ), lnJ = (8) 

iel jeJ 

Let B n denote the set of all such polynomials taken over all choices of the sets I and J. It is easy to 
show that \B n \ = 3 n . Since B n contains the standard monomial basis, so it spans the M- vector space of 
multilinear polynomials in . . . ,X n ]. Since this vector space has dimension 2 n , there are many ways 

to write a multilinear polynomial P(Xi, . . . ,X n ) as a linear combination of polynomials in B n . We will 
define spg(-P) as the minimum possible sparsity over all such linear combinations. Formally: 

Definition 6.3 Let P(Xi, . . . ,X n ) be a multilinear polynomial in M.[Xi, . . . ,X n ]. We define the sparsity 
of P(Xi, ... , X n ) over B n as 

k 

sp B (P) = {minA; j P(X U . . . ,X n ) = ^c^pTi, . . . ,X n ), Bi{X x , . . . , X n ) G B n }. 

i=i 
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The following lemma relating circuit-size for THR o AND circuits computing / and sparsity over B of 
polynomials that sign represent / follows from the preceding discussion. 

Lemma 6.4 For any Boolean function f : {0, l} n — » {0, 1}, 

S(f) = minsp e (P) 

over all polynomials P(X\, . . . , X n ) that sign represent f. 

Theorem 6.5 Every THR o AND circuit computing the parity function on {0, l} n has size at least (§) • 

Proof: We will show that if P(X\, . . . ,X n ) sign represents parity, then sp B (P) > (§) n . 

The proof is by induction on n. For n = 1, B\ = {Xi, 1 — X\, 1}. Since none of these polynomials or 
their multiples sign represents parity on 1 variable, spg(P) > 2. 

Now assume the claim holds for n — 1. Let P(X\, . . . , X n ) sign represent parity. Consider the sparsest 
representation of P over B. 

p(x u ...,x n ) = Yl ^jU^Ui 1 - x j) 

inj=<j> iel jeJ 

Grouping together monomials where X n appears, monomials where (1 — X n ) appears, and those where 
neither appears, we get 

P(Xx,...,X n ) = X n A(X 1 ,...,X n -i) + (l-X n )B(X u ...,X n .. 1 )+C(X 1 ,...,X n - 1 ). (9) 

The best (sparsest) way to write P(X±, . . . , X n ) as a linear combination of polynomials in B n is to use the 
best (sparsest) expression for each of A(X\, . . . , X n _i), B(X\, . . . , X n -x) and C(X\, . . . , X n —i) as linear 
combinations of polynomials in B n -\. Hence 

sp B (P) = sp B (^) + sp B ( J B) + s Ps (C). 

Substituting for X n in Equation Q, 

P(Xi, . . . , X n -i, 0) = B(Xi, . . . , X n _i) + C(X\, . . . , X n -i), 
P(Xi, . . . , X n _i, 1) = A(Xi, . . . , A" n _i) + C(Xi, . . . , X n -i), 
P(Xi, . . . , X n _i, 0) — P(Xi, . . . , X n _i, 1) = B(Xi, . . . , X n _i) — A(Xi, . . . , X n -i) 

All the polynomials on the LHS represent either parity or its complement on n — 1 variables. By applying 
the induction hypothesis, 

sp B (B) + sp B (A) > sp B (B-A) > (3/2)"" 1 , 
S p B (B) + sp B (C) > sp B (B + C) > (3/2r"\ 
sp B (A) + S p B (C) > S p B (A + C) > (3/2)- 1 . 

Adding these equations, we get 

sp B (P) = sp B (A) + sp B (B) + sp B (C) > (3/2)™ 

which completes the proof of the Theorem. | 
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Proposition 6.6 There is a THR o AND circuit computing the parity function on {0, 1}™ of size 0(5 3 ). 
Proof: The polynomial 

Q(X 1 ,X 2 ,X 3 ) = X ± X 2 X 3 + - X 2 )(l - X 3 ) + X 2 (l - X 3 )(l - X X ) + X 3 {\ - Xi)(l - X 2 ) 

exactly represents parity on {0, l} 3 . Hence, the polynomial P(X±, X 2 , X 3 ) = 1 — 2Q(X\, X 2 , X 3 ) sign 
represents parity on 3 variables and has sparsity 5. Hence the polynomial 

21 
3 

R(X) = l[P(X 3 ^ 2 ,X 3 ^ 1 ,X 3i ) 

i=l 

sign represents parity on n variables, and sp B (R) = 5 s . | 

Definition 6.7 The inner product function IP : {0, l} n x {0, l} n — > {0, 1} is defined as 

IP(ai, . . . ,a n ,6i, . . . ,6 n ) = ^Ja^ (mod 2). 

i 

Theorem 6.8 Every THR o AND circuit computing the inner product function on {0, l} n x {0, l} n has 
size at least 2 n . 

Proof: The proof is by induction on n. The base case is trivial. Assume the claim holds for n — 1. 

Let P(Xi, . . . , X n , Yi, . . . , Y n ) sign represent IP on {0, l} n x {0,1}™. Consider the sparsest way to 
write P(Xi, . . . , X n , Yi, . . . , Y n ) over B 2n . Grouping the monomials according to X n , Y n , where each Ai is 
a polynomial in Xi, Y\, . . . , X n ^x,Y n ^x, 

P{X U ...,X n ,Yi,...,Y n ) = X n Y n A x + X n {\ - Y n )A 2 + (1 - X n )Y n A 3 + (1 - X n )(l - Y n )A A 

+X n A 5 + Y n A 6 + (1 - X n )A 7 + (1 - Y n )A 8 + A 9 

Now substituting for X n , Y n and writing P(0, 1) for P{X\, . . . , A n _i, 0, Y\, . . . , Y n -\, 1) and so on, 



P(0,1) = A 3 + A e + A 7 + A 9 , (10) 

P(1,0) = A 2 + A 5 + A 8 + A 9 , (11) 

P(l,l) = Ax + A 5 + A 6 + A 9 . (12) 

Subtracting Equation (fT2|) from ([10]) and pTI) respectively, 

P(0,1)-P(1,1) = -vli + A 3 -A 5 + A 7 , (13) 

P(1,0)-P(1,1) = -A l + A 2 -A 6 + A 8 . (14) 

In the above equations the polynomials on the LHS represents IP or its complement on {0, l} n_1 x {0, l} n ~ 1 , 



so each has sparsity at least 2 n 1 by the induction hypothesis. Applying this observation to equations [T0| 
ii and [H 

sp B (^ 3 ) + sp B (^ 6 ) + sp B (^ 7 ) + sp B (A)) > 2 n ~\ 
sp B (A 2 ) + sp B (A 5 ) + sp B (A s ) + sp B (A 9 ) > 2 n ~\ 
sp B (Ai) + sp B (^ 3 ) + sp s (A 5 ) + sp B (^ 7 ) > 2 n ~\ 
sp B (Ai) + sp B (A 2 ) + sp B (A 6 ) + sp B (A 8 ) > 2 n -\ 
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Adding these equations, we get 



2 (X> B (A^ - 2sp B (^ 4 ) > 4 • 2 n -\ 



Hence 

9 
i=l 

which completes the proof. | 

Proposition 6.9 There is a THR o AND circuit computing the inner product function on {0, l} n x {0, l} n 

of size 2" . 

Proof: The polynomial 

Q(X 1 ,X 2 ,Y 1 ,Y 2 ) = X 1 Y 1 + X 2 Y 2 - 2X 1 Y 1 X 2 Y 2 
exactly represents IP on {0, l} 2 x {0, l} 2 . The polynomial 

P(X 1 ,X 2 ,Y 1 ,Y 2 ) = 1-2Q(X 1 , X 2 Y U Y 2 ) 
sign represents IP and on {0, l} 2 x {0, l} 2 and spg(P) = 4. For arbitrary n, the polynomial 

n 
2 

R(Xi, . . . , X n , Yx,..., Y n ) = P(X 2 i-\,X 2 i, Y 2 i^i,Y 2 i) 

i=\ 

sign represents IP on {0, l} n x {0, l} n and sp B (i?) = 4t = 2 n . | 
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